DEPARTMENT OF COMPUTER SCIENCE Fast Text Searching With Errors
نویسندگان
چکیده
Searching for a pattern in a text file is a very common operation in many applications ranging from text editors and databases to applications in molecular biology. In many instances the pattern does not appear in the text exactly. Errors in the text or in the query can result from misspelling or from experimental errors (e.g., when the text is a DNA sequence). The use of such approximate pattern matching has been limited until now to specific applications. Most text editors and searching programs do not support searching with errors because of the complexity involved in implementing it. In this paper we present a new algorithm for approximate text searching which is very fast and very flexible. We believe that the new algorithm will find its way to many searching applications and will enable searching with errors to be just as common as searching exactly.
منابع مشابه
An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملAn Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کاملDevelopment Hough transform to detect straight lines using pre-processing filter
Image recognition is one of the most important field in image processing that in recent decades had much attention .Due to expansion of related fields with image processing and various application of this science in machine vision, military science, geography, aerospace and artificial intelligence and lots of other aspects, out stand the importance of this subject.One of the most important aspe...
متن کاملDevelopment Hough transform to detect straight lines using pre-processing filter
Image recognition is one of the most important field in image processing that in recent decades had much attention .Due to expansion of related fields with image processing and various application of this science in machine vision, military science, geography, aerospace and artificial intelligence and lots of other aspects, out stand the importance of this subject.One of the most important aspe...
متن کامل